CSE 190 Matthew
نویسنده
چکیده
The dataset to be analyzed is the dataset of Reddit submissions suggested in lecture. (http://snap.stanford.edu/data/webReddit.html). In this dataset, there are 132,308 entries for analysis. Of these entries, however, there are only 16,736 unique images, and on average, each was resubmitted approximately 7.9 times. The timespan across which these Reddit submissions were made is July 2008 to January 2013. This dataset will be used to yield any possible information concerning the relation between a post’s success, the number of posts concurrently being submitted to Reddit, and the time of day. Specifically, which feature makes a better predictor, in comparison to other plausible prediction features. Additionally, another goal is to discover a correlation between number of hourly posts and average score of all posts within the hour. Below is a graph plotting the amount of posts for each hour of the day, according to the datapoints. This chart will be referenced in greater detail later.
منابع مشابه
Sentiment Analysis Classification for Rotten Tomatoes Phrases on Kaggle
In the second assignment for CSE 190: Data Mining and Predictive Analytics, we apply some techniques to improve the accuracy of classifying Rotten Tomatoes phrase sentiments. General Terms Algorithms, Experimentation
متن کاملSpeech perception in simulated electric hearing exploits information-bearing acoustic change.
Stilp and Kluender [(2010). Proc. Natl. Acad. Sci. U.S.A. 107(27), 12387-12392] reported measures of sensory change over time (cochlea-scaled spectral entropy, CSE) reliably predicted sentence intelligibility for normal-hearing listeners. Here, implications for listeners with atypical hearing were explored using noise-vocoded speech. CSE was parameterized as Euclidean distances between biologic...
متن کاملCSE 190, Great ideas in algorithms: Expander graphs
Our interest however will be in constructing large but very sparse graphs (ideally with d = 3) for which h(G) ≥ c for some absolute constant c > 0. Such graphs are “highly connected” graphs. For example, the following lemma shows that by deleting a few edges in such graphs, we can only disconnect a few vertices. This is very useful for example in network design, where we want the failure of edg...
متن کاملDecision Support System Requirements Definition for Human Extravehicular Activity Based on Cognitive Work Analysis
The design and adoption of decision support systems within complex work domains is a challenge for cognitive systems engineering (CSE) practitioners, particularly at the onset of project development. This article presents an example of applying CSE techniques to derive design requirements compatible with traditional systems engineering to guide decision support system development. Specifically,...
متن کاملCigarette smoke induces IL-8, but inhibits eotaxin and RANTES release from airway smooth muscle
BACKGROUND Cigarette smoke is the leading risk factor for the development of chronic obstructive pulmonary disease (COPD) an inflammatory condition characterised by neutrophilic inflammation and release of proinflammatory mediators such as interleukin-8 (IL-8). Human airway smooth muscle cells (HASMC) are a source of proinflammatory cytokines and chemokines. We investigated whether cigarette sm...
متن کامل